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Understanding Versinn 2 
01DB2/6000 And DB2/2 

By C. M. Saracco and Charles J. Bontempo 


O bject support, connectivity en¬ 
hancements, parallel I/O proc¬ 
essing and new optimization 
techniques are included in Version 2 of 
IBM’s DB2 products for AIX/6000 and 
OS/2 platforms. Such features are de¬ 
signed to improve the competitiveness of 
these products by providing greater flex¬ 
ibility, improved access to other data 
sources and better overall performance. 

Any discussion of such features is 
bound to raise questions. For example, 
what technology is IBM actually provid¬ 
ing in these areas? What value might 
these technologies bring to users? These 
questions are best answered by describ¬ 
ing each of these technologies and its po¬ 
tential benefits. 

Object Extensions 

Like many vendors, IBM has been 
working on incorporating a variety of 
object-oriented features into its relational 
DBMS products. As shown in Figure 1, 
such features include a more flexible 
type system, capable of accommodating 
multimedia data types as well as user-de¬ 
fined data types; additional integrity 
mechanisms (including triggers and con¬ 
straints) that help make the DBMS more 
active and enable users to associate more 
meaning with their data; and support for 
database access through C++ and 
Smalltalk. These features and others are 
often considered “object” extensions be¬ 
cause they provide some of the capabili¬ 
ties associated with object technology. 

For example, relational DBMSes are 
sometimes criticized for supporting only 
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“record-oriented” data (mainly, format¬ 
ted fields of character strings and nu¬ 
meric data types), while object-oriented 
programming languages enable users to 
create “nontraditional” types, some of 
which may employ an inherently com¬ 
plex internal structure. How are vendors 
such as IBM addressing this? 

DB2/6000 and DB2/2 Version 2 use 
2GB large objects (sometimes called 
“BLOBs”) to support a variety of nontra¬ 
ditional data types such as audio, com¬ 
pressed video, image and documents. A 
single table can contain a number of 
such large-object columns, and users can 
use various SQL functions and operators 
to access data within these columns. 

Support for user-defined types further 
enhances the type system of DB2/6000 
and DB2/2 Version 2. User-defined types 
enable users to define new types based 
on system-supplied types. In doing so, 
they can specify certain operational re¬ 
strictions for these new types to capture 
the behavior applicable for these types. 
For example, an “account number” type 
can be derived from the DBMS’ native 
support for integers. In doing so, users 
can specify that account numbers cannot 
be subjected to arithmetic operations, 
since adding or multiplying two account 
numbers would be considered invalid. 
This helps users capture more of the se¬ 
mantics they inherently associate with 
their data and can help prevent erroneous 
or misleading results. 

A related object extension involves 
support for user-defined functions, which 
enables users to define new SQL scalar 


DB2/6000 and 
DB2/2 Version2 
enable users to 
control the number 
and types of 
techniques considered 
by the optimizer 
when determining 
a reasonable access 
path to the data. 
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functions that can be incorporated into 
standard queries involving both system- 
supplied data types and user-defined data 
types. These functions, coded in a lan¬ 
guage such as C and registered with the 
DBMS through a CREATE FUNCTION 
statement, extend the breadth of opera¬ 
tions supported by the DBMS. If desired, 
user-defined functions can be written to 
understand the internal structure of a 
user-defined type based on large objects. 
Such functions can work with various 
portions of this structure as desired. 

Among the features of Version 2 that 
help provide for a more “active” environ¬ 
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ment and a broader range of integrity 
mechanisms are support for triggers, event 
alerters and constraints. Triggers enable 
users to instruct DB2/6000 or DB2/2 to 
automatically enforce various business 
policies whenever a given database event 
occurs (such as an attempt to insert or 
delete a row from a table, or update a 
given column of a table). Such activities 
can cause a trigger to execute, which, in 
turn, may cause some user-specified event 
to occur. This might include consulting 
data in other tables, updating other tables 
or rolling back the work. 

IBM’s trigger mechanism can also 
function as an event alerter, enabling a 
database event to cause a non-DBMS ac¬ 
tivity to automatically occur. This might 
include consulting or modifying a file, or 
sending an electronic message whenever 
someone makes a specified change to a 
named table. 

Constraints represent another integrity 
mechanism incorporated into DB2/6000 
and DB2/2 Version 2. They are useful 
for ensuring that the data values of one 
or more columns of a table fall within a 
certain range. For example, a constraint 
might ensure that serial numbers for all 
parts range from 1 to 1,000,000 or all 
salaried employees at the New York City 
office who work in department XYZ re¬ 
ceive at least a $1000 yearly bonus. 

Other object enhancements of 
DB2/6000 and DB2/2 Version 2 include 
support for recursive queries and two ob¬ 
ject-oriented programming languages. 
The latter involves support for embedded 
SQL in C++ as well as support for 
Smalltalk via Visual Age, an IBM tool that 
supports rapid application development. 

Connectivity Enhancements 

In addition to its first stage of object 
features, IBM also announced connectiv¬ 
ity enhancements for its DB2/6000 and 


DB2/2 Version 2 products. These en¬ 
hancements help improve access to both 
“live” and copied data. 

DBMS enhancements account for the 
improved access to “live” data. In this 
area, IBM has enabled its relational 
DBMSes to support distributed unit of 
work processing in a LAN-based envi¬ 
ronment. This means a single transaction 
could update multiple DB2/6000 or 
DB2/2 databases, with the DBMS auto¬ 
matically coordinating the two-phase 
commit processing required to ensure 
the integrity of the transaction. If de¬ 
sired, a transaction manager (such as 
CICS/6000) can be used instead to coor¬ 
dinate the two-phase commit processing. 

In addition, DB2/6000 and DB2/2 
Version 2 can service requests from 
other DBMSes that employ the Distrib¬ 
uted Relational Database Architecture 
(DRDA). A number of IBM and non- 
IBM DBMSes have announced or 
shipped support for DRDA “requestors,” 
including DB2 (for MVS), DB2/400 (for 
AS/400), DB2/VM (formerly SQL/DS), 
Informix (Informix Software, Inc., 
Menlo Park, CA) and Oracle (Oracle 
Corp., Redwood Shores, CA). In previ¬ 
ous releases, DB2/6000 and DB2/2 
could only make requests of DRDA-en- 
abled servers but could not respond to 
requests from DRDA-enabled products. 

While these distributed database en¬ 
hancements for “live” data access im¬ 
prove the connectivity of these DBMSes, 
a number of users have identified a need 
to maintain local copies of data that may 
be slightly out of sync with information in 
their (remote) production databases. This 
capability, which is sometimes referred to 
as “replication,” can offer certain per¬ 
formance gains and provide an effective 
alternative for managing local and remote 
resources. This is possible because net¬ 
work traffic can be minimized or more 
readily controlled (particularly if incre¬ 
mental changes to copies are scheduled to 
be sent or received only at specific inter¬ 
vals) and dependencies on remote re¬ 
sources (including network and DBMS 
software) can be kept to a minimum. 

In this area, IBM offers products that 
enable users to maintain copies of data 
stored in DB2 (on MVS and OS/400 
platforms) in their local DB2/6000 and 
DB2/2 databases. In addition, data from 
certain nonrelational sources (such as 
IMS) can be copied into DB2/6000 and 
DB2/2 tables as well. 
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7176 Pioneer Creek Road • Maple Plain, MN 55359 
Phone: (612) 932-9154 • Fai (612) 932-9155 

CIRCLE mi51 on Reader Service Card A 


DB2/6000 


Because decision support 
applications are particularly 
well-suited to a replicated en¬ 
vironment, IBM’s Data Propa¬ 
gator Relational product 
enables users to manipulate the 
data as desired before the 
changes are propagated. This 
enables users to copy only por¬ 
tions of tables, joined tables or 
the results of aggregate func¬ 
tions applied to one or more 
tables. In addition, copies can 
be scheduled to occur at spe¬ 
cific intervals (such as every 
hour or every day), and users 
may opt to have only the 
changes made since the last 
copy interval propagated. 

Parallelism And 
Performance Issues 

As with its other relational 
DBMSes, IBM has incorpo¬ 
rated support for parallelism 
into its DB2 products for AIX 
and OS/2. One aspect of this in¬ 
volves support for parallel I/O 
processing. This enables multi¬ 
ple I/O processes to be exe¬ 
cuted simultaneously, so data is 
moved more rapidly from disks 
to database buffers to help im¬ 
prove response time. In addi¬ 
tion, DB2/6000 and DB2/2 
Version 2 can perform parallel 
I/O processing for backup and 
recovery tasks, which can also 
improve response time. 

A related product, DB2 Par¬ 
allel Edition, was designed to 
exploit IBM’s POWERparallel 
System hardware and support 
read/write activities in a multi¬ 
processor environment. This 
multiprocessor environment is 
based on RISC System/6000 
technology and involves multi¬ 
ple nodes connected via a 
LAN or high-speed switch in a 
shared-nothing environment. 
Each node runs its own copy 
of the AIX operating system 
and the DBMS, and each has 
its own memory and disk(s). 
Queries can be decomposed 
into separate subtasks, with 
each node executing these 
tasks in parallel on its own 
portion of the data. 


Note that the hardware archi¬ 
tecture of this approach differs 
from that associated with IBM’s 
System/390 Parallel Query 
Server, an MVS-based prod¬ 
uct supported by DB2. The 
S/390 Parallel Sysplex (on 
which the Parallel Query Server 
runs) enables multiple nodes 
to share data on one or more 
disks through the use of high¬ 
speed fiber optic channels 
(ESCON). Each node itself 
consists of multiple processors 
configured in a shared-memo¬ 
ry environment. 

Optimization 

Enhancements 

Although support for various 
forms of parallelism can cer¬ 
tainly offer users performance 
gains, other DBMS issues must 
be considered as well. For ex¬ 
ample, query optimization can 
be critical for achieving high 
performance in a relational 
DBMS environment. This was 
one area of focus for DB2/6000 
and DB2/2 Version 2, which 
feature many optimization en¬ 
hancements, some of which 
were drawn from IBM’s 
STARBURST research project. 

One optimization improve¬ 
ment enables the DBMS to 
automatically transform many 
complex queries into more effi¬ 
cient forms, helping to ensure 
that good performance is 
achieved regardless of how the 
query is expressed. In addition, 
the optimizer has been en¬ 
hanced to consider a greater 
number of alternatives and 
more accurately estimate the 
cost of these. Because consider¬ 
ing more alternatives can in¬ 
volve increased resource usage, 
DB2/6000 and DB2/2 Version 
2 enable users to control the 
number and types of techniques 
considered by the optimizer 
when determining a reasonable 
access path to the data. This en¬ 
ables users to adjust the search 
space of optimization or spec¬ 
ify the level of optimization de¬ 
sired. By using this feature, 
users can fine tune their sys- 
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terns based on the particular queries is¬ 
sued by an application and the resources 
available on a particular platform. 

Another tuning mechanism commonly 
used is the EXPLAIN facility, which can 
help programmers and administrators 
understand the access plans selected by 
the optimizer for various queries. Under¬ 
standing the access plan selected can 
help users tune the DBMS, perhaps by 
adding an index on a table or updating 
catalog statistics. DB2/6000 and DB2/2 
Version 2 provide more detailed infor¬ 
mation through the EXPLAIN facility — 
information that is stored in tables to 
provide easy access. This EXPLAIN 
statement will support the syntax of the 
DB2 (MVS) EXPLAIN statement. 

In addition, Version 2 enables users to 
update certain catalog statistics that in¬ 
fluence optimization, changing these 
statistics and reviewing subsequent ac¬ 
cess plan information to perform “what 
if’ analysis on their queries. This capa¬ 
bility also enables users working in a test 
environment to enter statistics they be¬ 
lieve would be more characteristic of a 
production environment, thereby giving 
them a chance to make some early per¬ 
formance estimates and experiment with 
various alternatives. 

Other performance enhancements in¬ 
clude a high-speed LOAD utility, addi¬ 
tional database monitoring capabilities, 
additional support for prefetching data 
and support for tablespaces. The latter 
feature also offers administrators more 
control over the placement of their data 
as well as the ability to back up and re¬ 
store their data at a more granular level 
(at a tablespace level). • 

ABOUT THE AUTHOR 

C.M. Saracco has 10 years experi¬ 
ence working with both object and 
relational databases. Together with 
Charles Bontempo, she is currently 
writing a book on database topics to 
be published by Addison-Wesley. She 
works at IBM Software Solutions Di¬ 
vision, P.O. Box 49023, Office C360, 
San Jose CA 95161, (408)463-3107. 

Charles J. Bontempo has more than 
30 years experience in the database 
arena. He teaches and consults on 
database topics for IBM and at the 
Polytechnic University in New York. 
IBM Software Solutions Division, Route 
100, Office 4D-06, Somers, NY 10589, 
(914) 766-1394. 



Why gamble with performance when you can have a sure bet with 
Performance Solution™. You’re guaranteed a big pay-off as you: 

• Dynamically tune your 
VSAM, QSAM, and BSAM 


Select optimal buffering techniques 
based on file processing 


processing 

Cut load times and speed 
VSAM file reorganizations 

Customize performance 
requirements for processing 


• Automatically adjust region 
sizes to accommodate optimized 
buffer specifications 

• Install without program or 
JCL changes 


Performance Solution maximizes system-wide performance, saving 
you time, money and technical resources. 

Softworks’ Performance Solution guarantees you a winning hand 
every time. Call Softworks today and find out more about what 
Performance Solution can do for you and your data center. 


SOFTWORKS 

Think Solutions. Think Softworks. 

7700 Old Branch Ave., Clinton, Maryland 20735 
U.S.A. 1-800-727-4422 •International 1-301-856-1892 


ENTERPRISE SYSTEMS JOURNAL • SEPTEMBER 1994 


CIRCLE #147 on Reader Service Card A 


39 





































